METHOD AND APPARATUS FOR SOLVING KEY EQUATION 
POLYNOMIALS IN DECODING ERROR CORRECTION CODES 



BACKGROUND OF THE INVENTION 
In the transmission of data from a source location to a destination 

5 location through a variety of media, noise caused by the transmission path 
and/or the media itself causes errors in the transmitted data. Thus, the data 
transmitted is not the same as the data received. In order to determine the 
errors in the received data, various methods and techniques have been 
developed to detect and correct the errors in the received data. One of the 

10 methods is to generate a codeword which includes a message part (data to 
be transmitted) and a parity part (information for performing error 
correction). 

Among the most well-known error-correcting codes, the BCH 
(Bose-Chaudhuri-Hocquenghen) codes and the RS (Reed-Solomon) codes 
15 are the most widely used block codes in the communication field and 
storage systems. The mathematical basis of BCH and RS codes is 
explained by: E.R. Berlekamp, Algebraic Coding Theory, McGraw-Hill, 
New York, 1968; and Richard E. Blahut, Theory and Practice of Error 
Control Codes, Addison- Wesley, 1983. 
20 An (N, K) BCH or RS code has K message symbols and N coded 

symbols, where each symbol belongs to GF(q) for a BCH code or GF(q m ) 
for a RS code. A binary (N, K) BCH code can correct up to t errors with 
N=2m-1, N-K<=mt. An (N, K) RS code can correct up to t errors and p 
erasures with t=| n-k-p I For binary BCH codes, an error can be corrected 



25 simply by finding out the error location. For RS codes, an error can be 
corrected by finding out the error location and the error value. In RS codes, 
an erasure is defined to be an error with a known error location, and hence 
its correction reduces to finding the error value. 



The method steps for common popular RS decoder architectures for 



30 the correction of errors can be summarized into four steps: (1) calculating 
the syndromes from the received codewords, (2) computing the error 
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locator polynomial and the error evaluator polynomial, (3) finding the error 
locations, and (4) computing error values. If both errors and erasures and 
corrected, the four steps are modified to: (1) calculating the Forney 
syndromes from the received codewords and the erasure locations, (2) 
5 computing the errata locator polynomial and the errata evaluator 
polynomial, (3) finding the errata locations, and (4) computing the errata 
values. 

Referring to Fig. 1 , the general decoding steps are illustrated. Note that 
for simplification, the error-only RS decoder is introduced. The received 

10 data, R(x), is provided to a syndrome generator 101 to generate a syndrome 
polynomial, S(x), representing the error pattern of the codeword from 
which the errors can be corrected. The syndrome is then provided to a key 
equation solver 102 to generate an error locator polynomial, cj(x), and an 
error evaluator polynomial, Q(x). The error locator polynomial indicates 

15 the location(s) of the error and the error evaluator polynomial indicates the 
amount of the error. In the next step, the error locator polynomial is passed 
to a Chien search engine 103 to generate the root(s), pi, representing the 
location(s) of the errors. Then the error evaluator 104, receiving the root(s) 
and the error evaluator polynomial, Q(x), calculates the error value(s) of 

20 the root(s) 

The second step in the above-mentioned four-step procedure involves 
solving the key equation, which is 

S(x) ct(x)= Q(x) mod x N " K (1) 
where S(x) is the syndrome polynomial, a(x) is the error locator 
25 polynomial and Q(x) is the error evaluator polynomial. When both errors 
and erasures are corrected, o(x) and Q(x) are the errata locator polynomial 
and the errata evaluator polynomial, respectively. In addition, the errata 
locator polynomial a(x) becomes the product of A,(x) and A(x) 
corresponding to the error locator polynomial and the erasure locator 
30 polynomial, respectively. 

The techniques frequently used to solve the key equation (1) include 
the Berlekamp-Massey algorithm and the Euclidean algorithm. The 
extension of these algorithms to correct both errors and erasures can be 
found in the Blahut article cited above. Here a novel invertionless 
35 decomposed Euclidean architecture is invented to reduce the hardware 
complexity drastically while maintaining the over all decoding speed. 

Prior art technologies applied the traditional Euclidean algorithm (or 
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variation thereof) for the calculation of the error locator polynomial and 
the error evaluator polynomial, and designed circuits based upon these 
algorithms. However, each of these algorithms require a large number of 
registers, finite-field multipliers (FFM) and perhaps a finite-field inverters 

5 (FFI). Each of the FFMs and FFI translates into a hardware circuitry and 
real estate on an integrated circuit chip. Therefore, the goal here is to 
derive a method for solving of the polynomials in an efficient manner and 
to minimize the amount of circuitry required in the implementation of the 
algorithm. The number of registers and FFMs is typically a function of the 

10 variable t. Table 1 illustrates the authors of the architectures for correcting 
error-only codewords and the corresponding number of registers, FFMs 
and FFI: 



Reference 


Registers as a 
function oft 


FFMs as a 
function oft 


FFI 


Reed 


8t+2 


8t 


0 


Song 


6t+4 


6t+2 


0 


Wu 


7t+5 







TABLE 1 



From Table 1, Reed proposed the implementation of the inversionless 
15 Euclidean algorithm requires 8t+2 registers, 8t FFMs and no FFI in VISI 
Implementation of A Pipeline Reed-Solomon Decoder, IEEE Transaction 
on Computers, vol. C-34, pp. 393-403, May 1985. In addition, the article 
An Efficient Architecture for Implementing the Modified Euclidean 
Algorithm, the 9 th NASA Symposium on VLSI Design, Nov. 2000, Song 
20 demonstrated an architecture requiring 6t+4 registers and 6t+2 FFM's and 
no FFI. 

On the other hand, the article An Area-efficient Versatile 
Reed-Solomon Decoder for ADSL, IEEE International Symposium on 
Circuits and Systems, May 1999, Wu et al. presented the architecture 
25 reducing the number of FFMs but requiring the relatively complex FFI, 
which will limit speed and impose a significant hardware complexity. 

Therefore, it would be desirable to have an inversionless method and 
apparatus that requires no FFIs and minimizes the number of registers and 
FFMs in the implementation thereof. 
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SUMMARY OF THE INVENTION 



Accordingly, it is an object of the present invention to provide a 
method and apparatus for solving key equation polynomials in the 
decoding of codewords. Based upon the Euclidean algorithm, it can be 
implemented with minimal hardware circuitry. 

It is another object of the present invention to provide a method and 
apparatus for solving key equation within a t-step iterative decoding 
procedure while the prior art architectures require at most 2t iterations. 

It is yet another object of the present invention to provide a method 
and apparatus for solving key equation polynomials without decreasing the 
overall decoding speed of the decoder. 

Briefly, in a presently preferred embodiment, a method for computing 
error locator polynomial and error evaluator polynomial in the key 
equation solving step of the error correction code decoding process is 
presented whereby the polynomials are generated through at most t 
intermediate iterations that can be implemented with minimal amount of 
hardware circuitry. However, depending on the selected (N,K) code, the 
number of cycles required for the calculation of the polynomials would be 
within the time required for the calculation of upstream data. 

Additionally, a presently preferred embodiment for computing the 
error locator polynomial and the error value polynomial employs an 
efficient scheduling of a small number of registers and finite-field 
multipliers (FFMs) without the need of finite-field inverters (FFIs) is 
illustrated. Using these new methods, a new area-efficient architecture that 
uses only 4t+2p+4 registers and three FFMs and no FFIs is presented to 
implement the inversionless Euclidean algorithm. This method and 
architecture can be applied to a wide variety of RS and BCH codes with 
suitable code sizes. 

More specifically, the 3-FFM architecture of the presently preferred 
embodiment for solving key equation polynomials can also be utilized to 
calculate the Forney syndrome polynomial T(x) described above. This 
method and architecture can be applied to correct the error-only as well as 
the error-and-erasure codewords. 

An advantage of the present invention is that it provides a method and 
apparatus for solving key equation polynomials in the decoding of 
codewords. Based upon the Euclidean algorithm, it can be implemented 



with minimal hardware circuitry. 

Another advantage of the present invention is that it provides a 
method and apparatus for solving key equation polynomials within a 
t-iteration decoding procedure while other architectures require at most 2t 
5 iterations. It will maintain the overall decoding speed of the decoder. 

Yet another advantage of the present invention is that it provides an 
identical method and apparatus for not only solving key equation 
polynomials but also calculating the Forney syndrome polynomial T(x). It 
can be applied to the correction of errors as well as erasures. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG.l illusteates the processing blocks in the decoding or codewords; 

FIG.2a~FIG.2c shows a three-FFM architecture of the preferred 
15 embodiment for calculating the errata evaluator polynomial, Q(x), in the 
key equation solver. 

FIG.3 shows a three-FFM architecture of the preferred embodiment for 
calculating the errata locator polynomial, a(x), in the key equation solver. 

Description of the Preferred Embodiments 
20 Firstly, we will show our modified decoding procedure requiring at 

most t iterations while the previous decoding procedure requires at most 2t 
iteration. Following the inversionless Euclidean algorithm is illustrated and 

the errata value(s) and errata location(s) produced by Q(x) and ct(x) in 

our inversionless decoding procedure are identical to the errata value(s) 
25 and errata location(s) founded by Q(x) and cr(x) in the original algorithm. 
Secondly, we decompose the inversionless Euclidean algorithm for 
reducing the number of registers to 4t+2p+4 and the number of FFMs to 3. 
Finally, we show the condition on N, K such that our architecture can be 
applied. 

30 The Euclidean Decoding Procedure 

For illustrating the Euclidean algorithm, we rewrite Q) as: 

Q(x) = x N " K Q(x) + T(x)^(x) (2) 

5 



where Q(x) is the quotient polynomial of x N * K and T(x)^(x), T(x)=S(x)A(x) 
is the Forney syndrome polynomial, and a(x)=A,(x)A(x) is the errata 
locator polynomial, which is the product of the error locator polynomial, 
X(x), and the erasure locator polynomial, A(x). Therefore, the errata 
5 evaluator polynomial, Q(x), can be calculated by the similar process of 
computing the GCD polynomial of x N " K and T(x) through the Euclidean 
algorithm, whose decoding process can be shown as follows: 

Q ( - !) (x) = x N - K 

Q (0) (x) = T <X> 
10 Q (1) (x) = Q H) (x) - Q (0) (x).Q (1) (x) 

Q®(x) = Q (i " 2) (x) - Q (M) (x).Q (i) (x) (3) 

Q (n) (x) = Q (n ' 2) (x) - Q (n - 1) (x)-Q (n) (x) 

15 where Q (i) (x) is the i-th quotient polynomial and Q (l) (x) is the i-th 
remainder polynomial. After n division operations, the n-th remainder 
polynomial, Q (n) (x), is assumed to be the errata evaluator polynomial Q(x). 
From the extended form of Euclidean algorithm introduced by 
Error-Control Coding for Data Networks, Kluwer Academic, 1999, the 

20 similar decoding process except the minor difference in the initial 
condition can be used to determine the errata locator polynomial a(x), 
which is also described as follows: 

a (0) (x) = A(x) 

g (1) (x) = a (0) (x) -Q (1) (x) 

25 a (2) (x) = a (1) (x) >Q (2) (x) + a (0) (x) 

a (i) (x) = a (M) (x) <Q (i) (x) + a (i " 2) (x) (4) 

a (n) (x) = a (n - 1} (x) -Q (n) (x) + a (n " 2) (x) 
30 where A(x) = f^(1 + a'x) represents the erasure locator polynomial and A 
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is the erasure set. Note that all Q (i) (x) here are equivalent to the i-th 
quotient polynomial Q (i) (x) in (3). Similarly, after n iterations, a (n) (x) is 
assumed to the errata locator polynomial, a(x). From (3) and (4), it can be 
shown that the sum of deg(Q (i) (x)) and deg(a (5+1) (x)) equals to a constant 
5 number, N-K+s, where s is the number of actual erasures and hence, equals 
the degree of A(x). 



Our modified decoding procedure 

The proposed modified decoding procedure calculating the quotient 

polynomial with degree one in advance is shown as follows: 
M io Initial condition 

1 A (0) (x)=x N - K , Q (0) (x)=M (0) (x)=T(x) 

| a (0) (x)=0 , a (0) (x)=m (0) (x)=A(x) 

t For( i=0 to t ) 

SJ 5=deg(A (i) (x)) , A=deg(M (l) (x)) 

° 15 if(deg(a (i) (x))< A) 



25 



30 



A, 



(i) 



<(*> M ,„ 

q 0 (i) (x) = 0 forS=A 

MO) a (0 , M «a 0) 
a w (x) = a + M A _, A 8 for8^A 

Q (i+1) (x) = A (i) (x) + x 5 - A - 1 *M (i) (x)«q (i) (x) (5) 
a (i+1) (x) = a (i) (x) + x 5 - A - , «m (i) (x)«q (i) (x) (6) 
if(8-2<A) 

A (i+1) (x)=Q (i) (x) , M (i+1) (x)=Q (i+1) (x) 

a (i+1) (x)=a (i) (x) , m (i+1) (x)=a (i+1) (x) 

else 

A (i+1) (x)=Q (i+1) (x) , M (i+1) (x)=M (i) (x) 
a (i+1) (x)=a (i+1) (x) , m (i+1) (x)=m (i) (x) 

else 

Q(x)=Q (i) (x) , a(x)=a (5) (x) Finish 

where q (i) (x)=q! (i) +q 0 (i) is the i-th iteration quotient polynomial; Ag (l) and 
M A (i) are the leading coefficients of A (i) (x) and M (i) (x), respectively. A (l) (x) 
and M (i) (x) are the i-th iteration auxiliary polynomial for computing the i-th 
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errata evaluator polynomial Q (i+1) (x) and similarly, a (i) (x) and m (l) (x) are the 
i-th iteration auxiliary polynomial to calculate the i-th errata locator 
polynomial a (i+1) (x). Note that If there are only errors, the erasure locator 
polynomial, A(x) equals 1 and the Forney syndrome polynomial, T(x) 
should be altered to the syndrome polynomial S(x). 

As compared with (3), if we assume the previous remainder 
polynomial Q (i " !) (x) is equivalent to A (i) (x) and the remainder polynomial 
Q (i) (x) is equivalent to M^x), the difference in degree between Q (l) (x) and 
Q (M) (x) equaling 5-A implies the decoding procedure shown above will 



polynomial Q (i+1) (x). 

Note that our modified decoding procedure will stop at deg( Q (l) (x) ) < 
deg( a (i) (x) ) and in the meantime, a (i) (x) is the errata locator polynomial 
a(x) with degree of s+v. That s and v represent the number of actual 
erasure(s) and error(s). Recalling deg(a (0) (x)=deg(A (i) (x))=s, the degree of 
a (i) (x) will increase from s to s+v. In a specific case with degree of Q (l) (x) 
in (3) all equaling one, v division operations are needed and in the 
decoding procedure shown above, the total number of iterations is v as a 
result that accomplishing each division operation takes 1 iteration with 
5-A=deg(q (i) (x))=l. Owing to v<t, the modified decoding procedure above 
requires at most t iterations for solving key equation polynomials. 

The inversionless decoding procedure 

For eliminating the inverse operation within our modified decoding 
procedure, a novel inversionless decoding procedure is proposed and 
25 shown as follows: 



take at most 




calculate the next remainder 



Initial condition: 



A (0) (x) = x' 



,N-K 



Q (0) (x) = M (0) (x) = T(x) 



a< 0) (x) = 0 



a (0) (x) = m (0) (x) = A(x) 



For ( i=0 to t ) 
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8 = deg(A (i) (x)) 



A = deg(M (i) (x)) 



if(deg(d (i) (x))<A) 



q«(x) = A 6 w M A (i > 






q 0 (i) (x) = 0 


for 8*A 




6 (i) (x) = M (i) A (i) 

MO \ A / IVI A "8-1 


' +M A _ 1 (i) A 6 * (i) for5=A 




Q (i+1) (x) = M A (i) M A (i) 


•A (i) (x) + x 5 - A - 1 -M (i) (x)-q (i) (x) 


(7) 


d< i+1 >(x) = M A (i) M A (i) 


• a (i) (x) + x 5 - A - 1 -rh (i) (x)-q (i) (x) 


(8) 


if( 8-2 < A ) 






A (i+1) (x) = Q (i) (x) 


M (i+1) (x) = Q (l+1) (x) 




a (i+1) (x) = d (i) (x) 


m (i+1) (x) = a (i+1) (x) 





else 

10 A (l+1) (x) = Q (i+1) (x) , M (l+1) (x) = M (i) (x) 

a (i+1) (x) = d (i+1) (x) , m (i+1) (x) = m (i) (x) 

else 

fi(x) = Q (i) (x) , d(x) = d (i) (x) Finish 

where Q(x) and d(x) are the modified errata evaluator polynomial and 

15 errata locator polynomial, respectively, It can be shown that <r(x) and 
Q(x) can be used to find the same error location(s) and error value(s) as 
the original a(x) and Q(x) do. While compared with other approaches, our 
proposed inversionless Euclidean algorithm not only eliminates the costly 
inversion operation but also introduces a t-iteration decoding procedure. 

20 The decomposed Architecture 

Here we propose a decomposed architecture from the proposed 
inversionless Euclidean algorithm, which works with individual 
coefficients of the polynomial instead of the entire polynomial as a whole. 
As shown above, A (l) (x) and M (l) (x) can be assumed to the previous 



remainder polynomial, Q (M) (x), and the present remainder polynomial, 
Q (i) (x), respectively. The proposed decoding procedure calculates the i-th 

quotient polynomial q (i) (x) as q 0 (l> + q^x for eliminating at least one 

degree in the division operation and (7)~(8) can be rewritten as the 
5 following two equations: 

n<w)( x ) = Q A (i) Q A (i) .n( i - 1 >(x) + x 5 - A - 1 -Q (i) (x)-q (i) (x) (9) 

a (i+1) (x) = Q A (i) Q A (i) -a (i - 1) (x) + x 5 - A - 1 -a (i) (x)-q (i) (x) (10) 

While decomposing these two equations, we let 5-A=l without loss of 
generality and provide following definitions: 

10 =Q A (i) Q A (i) -Q™ +Q® -q 0 (i) +Q H m -q? 0 < j < 5-2 (11) 

d x (i+1) = Q A (i) Q A (i) • a™ + a x (i) • q 0 (i) + a M m • 0 < X < qH-1 (12) 

where Q. (i+1) and a^ (i+1> corresponds to the j-th and A,-th coefficient of 
Q (i+1) (x) and ct (1+1) (x) at the i-th iteration; Q (i+1) (x) = Q 0 +Q 1 x + ... + Q 8 _ 2 x 5 " 2 , 
8 being the degree of Q (M) (x) and a (i+1) (x) = a 0 +a 1 x + ... + a v+1 x v+1 , cp being 
15 the degree of d {i) (x) . Note that the sum of 8 and cp equals N+K+s, which is 
the sum of deg(Q (M) (x))+deg(d (i) (x)), From (11)~(12), if Q A (i) Q A (i) , q 0 (i) 
and can be calculated in advance, there only three finite-field 

multipliers (FFMs) needed to compute Q j (W) and . The detailed 

cycle operation of our inversionless decomposed architecture can be seen 
20 in Table 1. 
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TABLE 2 



It is evident from Table 1 that, at cycle j=0, the computation of Q 0 (l+1) 

requires Q A (i> Q A (i) and q 0 (i) , which have been calculated at the 

5 initialization cycle. Similarly, at cycle j>l, the computation of also 

requires ,which has been calculated at cycle j=0. Note that each cycle 
needs three finite-field multiplications and the calculations process of 
a (i+1) (x) is similar to that of Q (i+1) (x). 

The inversionless decomposed Euclidean algorithm shown above 
10 suggests a 3-FFM implementation of the Key Equation Solver, which is 
illustrated in Fig.2. The branch labeling in Fig.2 corresponds to a particular 

time instance while computing Q (i+1) (x) . As compared with Table 1, 



n 



Fig.2(a) shows the initialization cycle and Fig.2(b) indicates the calculation 
cycle for q, (i) and Q 0 (i+1) . The process for computing other coefficients of 

Q (i+1) (x) is expressed in Fig.2(c). Because the computation process of 

d (i+1) (x) is similar to that of Q (i+1) (x), the hardware used to compute 

Q (i+1) (x) can be reconfigured to calculated d (l+1) (x), which is presented in 
Fig.3. 

This architecture can be used for error-only correction as well as 
error-and-erasure correction. Compared to existing proposals requiring 6t 
to 8t FFMs, the preferred embodiment of the present invention 
significantly reduces hardware complexity down to 3 FFMs. However, in 
order to finish the i-th iteration, the architecture of the preferred 
embodiment requires 8+\|/+l cycles whereas prior art architectures requires 
only two to three cycles. 

The additional time required for generating the data under the 
architecture of the present invention does not slow down the overall system 
processing speed. One reason here is the overall system processing speed is 
dominated by syndrome calculation and Chien Search, not key equation 
solver illustrated in Fig. 1 . Therefore, our architecture slowing down the 
Euclidean algorithm (till taking N cycles) will not impact the decoding 
speed. 

Additionally, the method and apparatus of the present invention also 
minimize the amount of required registers. As shown earlier, 

deg(a (i) (x) )+deg(Q (i+1) (x) )=N-K+s<2t+p, where t and p represent the 

number of errors and erasures in the decoding of codewords and 
consequently, in the preferred embodiment of the present invention, 

2t+p+2 registers are used to store the coefficients of Q (M) (x) and d (l) (x), 

and another 2t+p+2 registers used for storing the coefficients of Q (,) (x) 

and 6 (i+1) (x). Hence, calculating Q (i+1) (x) and 6 (i+2) (x) iteratively totally 
requires 4t+2p+4 registers and if there are only errors corrected, the 
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amount of required registers is 4t+4 and the previously proposed 
architectures requiring 6t to 8t registers. 

Furthermore, the preferred embodiment of the present invention can 
also be used to calculate the Forney syndrome polynomial, T(x), which is 
5 defined as: 

T(x)=S(x)A(x) mod x N " K (13) 

where A(x) = fj(1 + XjX) is the erasure locator polynomial and Xj is the 

i=i 

j-th erasure magnitude. T(x) can be obtained by following procedures: 
Initial condition 

10 T (0) (x) = S(x) 

For( i=0 to t ) 
if( 2i < s ) 

A (i) (x) = (1 + X2iX)(1 + X 2i+ iX) < 14 > 
T (i+1) (x) = T (i) (x)-A (i) (x) modX N " K (15) 

15 else 

T (i) (x) = T(x) Finish 
where A (i) (x) is the I-th auxiliary polynomial for computing the i-th 
iteration Fonrey syndrome polynomial, T (i+1) (x). Note that A (i) (x) can be 
expressed as 1 + A 1 (i) x + A 2 (i) x 2 and T (i+1) (x) can be decomposed as the 
20 following results: 

T T (i+1) =T T (i) +T T _ 1 (i) • A, (i) +T T _ 2 (i) -A 2 (i) 0 < t < N-K-l (16) 

It is evident that the process calculating the T-th coefficient, T T (I+1) , is 

very similar to that in (11) and therefore, the 3-FFM architecture can be 
used to obtain the Forney syndrome polynomial, T(x). 

25 Application Conditions 

The total number of cycles required to compute d(x) and Q(x) 

using the 3-FFM architecture of the preferred embodiment is of interest in 
considering the potential impact on the overall system performance. From 

13 



the proposed iterative decoding process, 0<j<5-2 in (11) and 0<X<\\/+1 in 
(12) implying the number of cycles required to compute Q (i+1) (x) is 8-1 

and calculating a (i+1) (x) needs \{/+2 cycles in the i-th iteration. However, 

one more cycle is needed to get q/ 0 and q 0 (l) , and the proposed decoding 

5 procedure requires 8+\j/+2 cycles in one iteration totally. Note that 
8+\j/=N-K+s<2t+p. For RS (N,K) code of correcting t errors and p erasures, 
the total number of cycles required in our t-iteration decomposed 
inversionless architecture is less than 2t 2 +pt+2t. 

Table 3 shows the maximum number of cycles for different RS (N,K) 

10 codes with N-K ranging from 4 to 16. If N is larger than the number of 
cycles required, then our 3-FFMs architecture can be applied to reduce the 
h ardware complexity while maintaining the overall decoding speed. 
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TABLE 3 

There are many applications of BCH and RS codes in 
15 communications and storage systems that benefit from methods and 
apparatus of the present invention. For example, Digital Versatile Disks 
(DVDs) use a RS product code which is (182,172) in the row direction and 
(208,192) in the column direction. Digital TV broadcasting uses a (204,188) 
RS code. CD-ROM uses a number of smaller (32,28) and (28,24) RS codes. 
20 In the optical fiber submarine cable systems, RS (255,239) code is used 
and standardized to provide burst error correcting capability. In wireless 
communications, the AMPS cellular phone system uses (40,28) and (48,36) 
binary BCH codes, which are shortened codes of the (63,51) code. The 
(63,51) code, which can correct up to 2 errors (N-K=12,m=6), requires 
25 fewer than 12 cycles (t=2, row 1 of Table 3). All of these applications, as 
well as many others, can benefit from the method and apparatus of the 

14 



present invention. 

Although the present invention has been described in terms of specific 
embodiments, it is anticipated that alterations and modifications thereof 
will no doubt become apparent to those skilled in the art. It is therefore 
intended that the following claims be interpreted as covering all such 
alterations and modifications as fall within the true spirit and scope of the 
invention. 
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